Performance Evaluation of Learning Classifiers for Speech Emotions Corpus using Combinations of Prosodic Features

نویسندگان

  • Syed Abbas Ali
  • Sitwat Zehra
  • Afsheen Arif
چکیده

This paper introduces the speech emotion corpus, a multilingual speech emotion database recorded in the provincial languages of Pakistan: Urdu, Punjabi, Pashto and Sindhi for analyzing the speech emotions present in the recorded speech signals with the four different emotions (Anger, Sadness, Comfort and Happiness). The objective of this paper is to evaluate the performance of the learning classifiers (MLP, Naive Bayes, J48, and SMO) for speech emotion corpus recorded in the provincial languages of Pakistan with different combinations of prosodic features in term of classification accuracy and time taken to build models. The experimental results clearly show that the J48 classifier performs far better than all other classifiers in term of both classification accuracy and model building time. SMO indicates slightly better classification accuracy than Naïve Bayes classifiers whereas; Naïve Bayes exhibit minimum model building time as compared to MLP.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Characterization of Emotions Using the Dynamics of Prosodic Features

In this paper the dynamics of prosodic parameters are explored for recognizing the emotions from speech. The dynamics of prosodic parameters refer to local or fine variations in prosodic parameters with respect to time. The proposed dynamic features of prosody are represented by : (1) sequence of durations of syllables in the utterance (duration contour), (2) sequence of fundamental frequency v...

متن کامل

Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech

In this paper, a new supervised classification paradigm, called classifier subset selection for stacked generalization (CSS stacking), is presented to deal with speech emotion recognition. The new approach consists of an improvement of a bi-level multi-classifier system known as stacking generalization by means of an integration of an estimation of distribution algorithm (EDA) in the first laye...

متن کامل

Automatic discrimination between laughter and speech

Emotions can be recognized by audible paralinguistic cues in speech. By detecting these paralinguistic cues that can consist of laughter, a trembling voice, coughs, changes in the intonation contour etc., information about the speaker’s state and emotion can be revealed. This paper describes the development of a gender-independent laugh detector with the aim to enable automatic emotion recognit...

متن کامل

Emotion recognition based on phoneme classes

Recognizing human emotions/attitudes from speech cues has gained increased attention recently. Most previous work has focused primarily on suprasegmental prosodic features calculated at the utterance level for modeling against details at the segmental phoneme level. Based on the hypothesis that different emotions have varying effects on the properties of the different speech sounds, this paper ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013